FFAVOD: Feature fusion architecture for video object detection
نویسندگان
چکیده
• We designed a novel architecture for video object detection that capitalizes on temporal information. fusion module to merge feature maps coming from several temporally close frames. proposed an improvement the SpotNet attention module. trained and evaluated our with three different base detectors two traffic surveillance datasets. demonstrated consistent significant of model over baselines. A amount redundancy exists between consecutive frames video. Object typically produce detections one image at time, without any capabilities taking advantage this redundancy. Meanwhile, many applications work videos, including intelligent transportation systems, advanced driver assistance systems surveillance. Our aims similarity better detections. propose FFAVOD, standing detection. first introduce allows network share nearby Second, we learns enhance them. show using can improve performance benchmarks containing sequences moving road users. Additionally, further increase performance, Using improved detector, obtain state-of-the-art UA-DETRAC public benchmark as well UAVDT dataset. Code is available https://github.com/hu64/FFAVOD .
منابع مشابه
Feature-Level based Video Fusion for Object Detection
Fusion of three-dimensional data from multiple sensors gained momentum, especially in applications pertaining to surveillance, when promising results were obtained in moving object detection. Several approaches to video fusion of visual and infrared data have been proposed in recent literature. They mainly comprise of pixel based methodologies. Surveillance is a major application of video fusio...
متن کاملMulti Sensor Fusion for Object Detection Using Generalized Feature Models
This paper presents a multi sensor tracking system and introduces the use of new generalized feature models. To detect and recognize objects as selfcontained parts of the real world with two or more sensors of the same or of several types requires on the one hand fusion methods suitable for combining the data coming from the set of sensors in an optimal manner. This is realized by a sensor fusi...
متن کاملDeep Spatial-Temporal Joint Feature Representation for Video Object Detection
With the development of deep neural networks, many object detection frameworks have shown great success in the fields of smart surveillance, self-driving cars, and facial recognition. However, the data sources are usually videos, and the object detection frameworks are mostly established on still images and only use the spatial information, which means that the feature consistency cannot be ens...
متن کاملVideo Fire Detection Algorithm using Multi-Feature Fusion
At present, the moving target detection and flame characteristics extraction almost become the most important parts in majority of video fire detection systems. Through the above two-part study, a new fire features detection method is presented in precise moving target area. That is, using the improved background difference method and flame features (such as the color and uniformity, Wavelet en...
متن کاملFeature Detection of an Object by Image Fusion
In this paper, we propose a novel method for feature detection of an object by fusion of range and intensity images. For this purpose, we have developed a data acquisition system with a laser source and camera interfaced with Silicon Graphics machine. 3-D mesh representation of the surface of the object is obtained from the acquired range images. Extraction of structural features from the range...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Pattern Recognition Letters
سال: 2021
ISSN: ['1872-7344', '0167-8655']
DOI: https://doi.org/10.1016/j.patrec.2021.09.002